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LINEAR PEPTIDES DERIVED FROM ANTIBIOTIC PEPTIDES, 
PREPARATION AND USE FOR VECTORING ACTIVE SUBSTANCES 



The invention concerns linear peptides derived 
from antibiotic peptides and their use for vectoring 
active substances. More particularly, the subject of the 
invention is new compounds formed from a linear 
5 derivative of an antibiotic peptide coupled to at least 
one active substance, and the preparation of these 
compounds and compositions containing them. 

In addition to their immunity system responsible 
for specific defence mechanisms against infectious 

10 agents, vertebrates have numerous peptides with anti- 
microbial activity (Nicolas P. et al . , 1995, Annual Rev. 
Microbiol. 49, 277-304). These peptides only exist in 
invertebrates having a short lifetime and a high renewal 
rate, in whom a memory immunity system, long in forming 

15 and developing appropriate response, is ill -adapted. 

The ant i -microbial peptides .of vertebrates, 
irrespective of their origin, lower or higher 
vertebrates, myeloid or non-myeloid tissue, have a 
certain number of properties in common : 

20 - high basicity due to the presence of 



numerous arginines and lysines, 



the ability to form amphipathic ^ S 

structures. By amphipathic structure is meant structures oc 

in which the hydrophobic residues are separated in space (f _j ^ 

25 from hydrophilic residues, (uy ^ CJ> 

a very wide activity spectrum. They are 
able to rapidly destroy bacteria (Gram"*" and Gram"), 
fungi, a few protozoa, membrane viruses and even some 
cancer cell lines. 

30 According to their structure, antibiotic peptides 

can be divided into three major families : 
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amphipathic a-helical antibiotic peptides: 
cecropins and maganins (Maloy W.L. et al., 1995, 
BioPolymer 37, 105-122), 

P-stranded antibiotic peptides linked by 
5 disulphide bonds : defensins (Lehrer R.I. et al . , 1991, 
Cell 64:229-230 ; Lehrer R.I. et al . , 1993, Ann. Rev. 
Immunol. 11:105-128), protegrins (Kokryakov V.N. et al., 
1993, FEES 337:231-236), tachyplesins (Nakamura T. et 
al., 1988, J. Biol. Chem. 263:16709-16713 ; Miyata T. et 
10 al., 1989, J. Biochem. 106:663-668), 

antibiotic peptides having destructured 
chains with many angles due to the presence of multiple 
prolines : bactenecins and PR3 9 (Frank R.W. et al . , 1991, 
Eur. J. Biochem. 202, 849-854) . 

15 Despite the diversity of their sequences, most 

antibiotic peptides act by direct lysis of the membrane 
of pathogenic cells. Their basicity promotes their 
interaction with negatively charged phospholipids, and 
being amphipathic they are subsequently able to 

20 incorporate themselves into the membrane in which they 
aggregate to form pores through which the cell loses its 
substance. It is generally accepted that their 
preferential selectivity for prokaryote cells is due to 
the special composition of their membranes which contain 

25 more anionic phospholipids than those of eukaryotes . 
Also, the plasma membranes of mammalian cells all contain 
cholesterol whose role is to modulate their fluidity, 
which could hinder the incorporation of antibiotic 
peptides. However, the specificity of the latter for 

30 micro-organisms is low, meaning that they show strong 
cytotoxicity which limits their use. 

The presence of antibiotic peptides in 
vertebrates, and more particularly in mammalians, raises 
numerous queries. Immunologists assume that the compounds 
35 having non-specific ant i -microbial activity found in 
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invertebrates constitute an ancestral means of defence 
which later developed leading to much more complex memory 
systems. What is the advantage therefore, in mammalians 
for example, of having preserved some peptides with 
5 antibiotic activity ? It is supposed that these small 
molecules that are always present in biological fluids, 
or sequestered in some lymphocyte structures, could form 
a first line of defence while awaiting the secretion of 
specific antibodies (Nicolas P. et al . , 1995, Annual Rev. 
10 Microbiol. 49, 277-304) . They could also, within the 
macrophages, take part in the destruction of plasma 
membranes of pathogenic organisms . 

Regardless . of their exact role, antibiotic 
peptides are of considerable interest owing to their wide 

15 spectrum of activity and the difficulty encountered by 
micro-organisms to set up inactivation strategies. On 
this account very numerous research studies have been 
conducted to endeavour to find new molecules and to 
obtain better performing analogues than the parent 

20 peptides. It is possible that in the future these 
antibiotic peptides are called upon to replace the 
antibiotics derived from bacteria or fungi. For example, 
PCT international patent applications published under 
numbers WO95/03325, WO96/37508 and WO97/02287 describe a 

25 new class of antibiotic peptides called ^"protegrins" , 
isolated from porcine leukocytes or even prepared by 
chemical synthesis or genetic engineering and having 
antibacterial, antiviral and antifungal activities. 

At the present time, p-stranded antibiotic 
30 peptides linked by disulphide bonds (defensins, 
protegrins, tachyplesins) are a particular subject of 
research on account of their powerful anti-microbial 
activity (bacteria, some viruses, fungi and parasites) . 
Within this family, protegrins and tachyplesins are 
35 certainly the most promising molecules given the 
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simplicity of their structure and the relative ease with 
which they can be synthesised. 

The name protegrins denotes a group of five 
peptides called PG-l, PG-2, PG-3, PG-4 and PG-5 whose 
5 sequences are given below, closely resembling and 
isolated from porcine leukocytes (V.N. Kokryakov et al . , 
FEBS lett. 327, 231-236) : 

PG-1 : Arg Gly Gly Arg Leu Cys Tyr Cys Arg Arg 
Arg Phe Cys Val Cys Val Gly Arg -NH2 (SEQ ID NO : 1) 

10 PG-2 : Arg Gly Gly Arg Leu Cys Tyr Cys Arg Arg 

Arg Phe Cys He Cys Val -NH2 (SEQ ID NO : 2) 

PG-3 : Arg Gly Gly Gly Leu Cys Tyr Cys Arg Arg 
Arg Phe Cys Val Cys Val Gly Arg -NH2 (SEQ ID NO : 3) 

PG-4 : Arg Gly Gly Arg Leu Cys Tyr Cys Arg Gly 
15 Trp He Cys Phe Cys Val Gly Arg -NH2 (SEQ ID NO : 4) 

PG-5 : Arg Gly Gly Arg Leu Cys Tyr Cys Arg Pro 
Arg Phe Cys Val Cys Val Gly Arg -NH2 (SEQ ID NO : 5) 

Tachyplesins (Tamura H. et al . , 1993, Chem. 
Pharm. Bui. Tokyo 41, 978-980) denoted Tl, T2 and T3 and 

20 polyphemusins (Muta T., 1994, CIBA Found. Sym. 186, 160- 
174 ) denoted PI and P2 whose sequences are given below, 
are homologous peptides isolated from the hemolymph of 
two crabs Tachyplesus tridentatus for Tachyplesins Tl, T2 
and T3, and Liimnulus polyphemus for Polyphemusins PI and 

25 P2 . 

PI : Arg Arg Trp Cys Phe Arg Val Cys Tyr Arg 
Gly Phe Cys Tyr Arg Lys Cys Arg -NH2 (SEQ ID NO : 6) 

P2 : Arg Arg Trp Cys Phe Arg Val Cys Tyr 

Lys Gly Phe Cys Tyr Arg Lys Cys Arg -NH2 (SEQ ID NO : 7) 

30 Tl : Lys Trp Cys Phe Arg Val Cys Tyr Arg 

Gly lie Cys Tyr Arg Arg Cys Arg -NH2 (SEQ ID NO : 8) 

T2 : Arg Trp Cys Phe Arg Val Cys Tyr Arg 

Gly lie Cys Tyr Arg Lys Cys Arg -NH2 (SEQ ID NO : 9) 
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T3 : Lys Trp Cys Phe Arg Val Cys Tyr Arg 

Gly lie Cys Tyr Lys Arg Cys Arg -NH2 (SEQ ID NO : 10) 

Protegrins, tachyplesins and polyphemusins 
contain a high proportion of base residues (lysines and 
5 arginines) and have four cysteines which form two 
parallel disulphide bonds. These three families of 
peptides also show homologies with some def ensins in 
particular with the human defensin NP-1 (Kokryakov V.N. 
et al., 1993, Febs Let. 327, 231-236). 

10 Tachyplesins and protegrins have a closely 

resembling three-dimensional structure. It is an anti- 
parallel P strand stabilised by the two disulphide bonds. 
These bonds play an important role in the antibacterial 
activity of protegrins and tachyplesins. Their removal, 

15 either by protecting the SH groups with acetamidomethyls , 
or by replacing the cysteines with alanines or glycines, 
leads to obtaining analogues virtually devoid of in vivo 
activity (Lehrer R.I. et al . , 1996, Eur. J. Biochem. 
240 :352-357) . 

20 As previously indicated, protegrins and 

tachyplesins have substantial lysis activity on 
prokaryote cells . Research work conducted by the 
Applicant on the cytotoxicity of these peptides on 
cultured mammalian cells, have shown that, prior to the 

25 death of the cells, there are non-negligible quantities 
of protegrins and tachyplesins in the cytoplasm of said 
cells. It was considered that the presence of peptides in 
the cytoplasm could be the outcome of transport via 
pores, but these pores are only permeable to ions and 

30 small molecules and their diameter is too small to give 
passageway to antibiotic peptides. It would seem that 
protegrins and tachyplesins, in addition to perforating 
the plasma membrane, are able to pass through it. 

The cytotoxicity and antimicrobial activity of 
35 protegrins and tachyplesins are known to derive from 
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their ability to aggregate inside the membrane to form 
multimeric channels (Mangoni M. et al . , 1996, Febs Let, 
383, 93-98) . The Applicant therefore considered that this 
aggregation might be connected with the tertiary 
5 structure of these antibiotic peptides, which comprise 
several cysteine residues, and linear derivatives of 
protegrins and tachyplesins in which the cysteines are 
replaced by various natural amino acids have been 
prepared. These peptides were coupled, at their N- 
10 terminal end, to a fluorescent molecule or to biotin and 
the distribution of these markers inside the cell was 
observed under confocal microscopy. 

In this way, it was found that these peptides are 
non- toxic and have no lytic activity but are, on the 
15 other hand, able to pass rapidly through the membranes of 
mammalian cells via a passive mechanism. 

These linear derivatives of antibiotic peptides 
therefore constitute a new, non-toxic, system for 
vectoring active substances. 

20 By vectoring system is meant, according to the 

invention, a process capable of conveying said active 
substance to a target, such as for example : 

to cause an active substance to pass through 
the cell membrane and to allow the distribution of said 
25 substance in the cytoplasm and/or in the nuclear 
compartment . 

to bring an active substance to a particular 
organ, for example to cause this active substance to pass 
through the blood-brain barrier, 

30 - to force this active substance to interact 

specifically with a given cell type, erythrocytes for 
example . 

The subject of the present invention is therefore 
peptides derived from antibiotic peptides or analogues 
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thereof, characterised in that they are devoid of a 
disulphide bond. 

By analogue of antibiotic peptides is meant a 
peptide whose amino acid sequence has been modified 
5 without causing any modification in the antibiotic 
properties of said peptide. 

The absence of a disulphide bond in the peptides 
of the invention, may be obtained by any means known to 
those skilled in the art, for example by : 

10 - removing, or replacing with other amino 

acids, the cysteine residues of the a.ntxbiotic peptide 
sequence, ^ 

blocking the -SH groups of the cysteine 
residues such that they do not form a disulphide bond, 

15 provided, evidently, that the peptide obtained 

has vectoring properties that are not toxic for the 
previously described cells. 

These modifications may be conducted during the 
preparat ion of the pept ides of the invent ion , more 
20 particularly by chemical synthesis or the expression of a 
gene coding for said peptide, or directly on an 
antibiotic peptide through the action of chemical agents 
enabling the opening and blocking of the -SH groups of 
the cysteine residues. 

25 The above modifications advantageously concern 

all the cysteine residues of the antibiotic peptide, but 
should the presence of a single cysteine residue not 
allow the formation of a disulphide bond, the peptides of 
the invention may contain a single cysteine. Natural 

30 antibiotic peptides generally have 4 or 6 cysteine 
residues able to form two or three disulphide bonds , 
therefore in the peptides of the invention only one of 
these cysteines can be maintained and the three or five 
others are modified or blocked. 
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The antibiotic peptides from which the peptides 
of the invention are derived may be defensins, 
protegrins, tachyplesins or their analogues whose 
antibiotic properties are imparted to them by their 
5 tertiary structure resulting from the presence of 
disulphide bonds. 

Linear peptides of the invention meet one of the 
following formulas : 

Baa Xaa Xaa Baa Xaa Xaa Xaa Xaa Baa Baa Baa Xaa 
10 Xaa Xaa Xaa Xaa Xaa Baa (I) (SEQ ID NO : 11) 

Baa B3_3. X3,a Xaa Xaa Baa, Xa,a Xaa Xaa Baa Xaa Xaa 
Xaa Xaa Baa Baa Xaa Baa (II) (SEQ ID NO : 12) 

which may also be represented by the following 
single formula (III) : 

15 Baa (Xaa Baa) Xaa (Xaa Baa) Xaa (Xaa Baa) Xaa Xaa 

(Xaa Baa) Baa (Xaa Baa) Xaa Xaa Xaa (Xaa Baa) (Xaa Baa) 
Xaa Baa (SEQ ID NO : 38) 

in which : 

the Baa groups, identical or different, 
20 represent an amino acid residue whose side chain carries 
a base group, and 

the Xaa groups, identical or different, 
represent an aliphatic or aromatic amino acid residue, 

or are made up of a sequence of at least 5, 
25 preferably at least 7, successive amino acids of either 
of formulas (I) or (II) , if this sequence has vectoring 
properties that are non-toxic for the previously 
described cells. 

Baa and Xaa may or may not be natural amino 
30 acids, including D-amino acids. 

As an example the following denotations of Baa 
and Xaa may be cited : 
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Baa is chosen from among arginine, lysine, 
diaminoacetic acid, diaminobutyric acid, 

diaminopropr ionic acid, ornithine. 

Xaa is chosen from among glycine, alanine, 

5 valine, norleucine, isoleucine, leucine, cysteine, 

. Acm • 
cysteine , penicillamine, methionine, serine, 

threonine, asparagine, glutamine, phenylalanine, 

histidine, tryptophan, tyrosine, proline, Amino butyric 

acid, carboxylic amino- 1 -eye lohexane acid, Amino 

10 isobutyric acid, carboxylic 2-aminotetraline, 4- 
bromophenylalanine , tert -Leucine , 4 - chlorophenylalanine , 
(3-cyclohexylalanine, 3 , 4 -di chlorophenylalanine, 4- 

f luorophenylalanine , homoleucine , p-homoleucine , 

homophenylalanine , 4 -methylphenylalanine , 1 - 

15 naphthylalanine , 2 -naphthylalanine , 4 -nitrophenylalanine , 
3-nitrotyrosine, norvaline, phenylglycine, 3- 

pyridylalanine, P- (2-Thienyl) -alanine. 

The invention also concerns peptide derivatives 
having the formula ( I ) or (11) such as said pept ides in 
20 retro form, or moieties of peptides having the formula 
(I) or (II) made up of five, preferably seven, successive 
amino acids of either formula (I) or (II) . 

Among the peptides of the invention, special 
mention can be made of those meeting the following 
25 formulas : 

Arg Xaa Xaa Arg Xaa Uaa Xaa Uaa Arg Arg Arg Xaa 
Uaa Xaa Uaa Xaa Xaa Arg -NH2 (V) (SEQ ID NO : 13) 

Arg Arg Xaa Uaa Xaa Arg Xaa Uaa Xaa Arg Xaa Xaa 
Uaa Xaa Arg Arg Uaa Arg -NH2 (VI) (SEQ ID NO : 14) 

30 

in which : 

Uaa represents serine or threonine 

the Xaa groups, identical or different, 
represent an amino acid which may or may not be natural 
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(including D-amino acids) , either aliphatic or aromatic, 

such as among glycine, alanine, valine, norleucine, 

Acm 

isoleucine, leucine, cysteine, cysteine , 

penicillamine, methionine, serine, threonine, asparagine, 

5 glutamine, phenylalanine, histidine, tryptophan, 
tyrosine, proline, Amino butyric acid, carboxylic amino- 
1-cyclohexane acid, Amino isobutyric acid, carboxylic 2- 
aminotetral ine , 4 -bromophenylalanine , tert -Leucine , 4 - 
chlorophenylalanine, p-cyclohexylalanine, 3,4- 

10 dichlorophenylalanine, 4-f luorophenylalanine, 

homoleucine, P-homoleucine, homophenylalanine, 4- 

J — ^ J — — — I — r u — f — 

naphthyl alanine , 4 -nitrophenylalanine , 3 -nitrotyrosine , 
norvaline, phenylglycine, 3-pyridylalanine, P- (2- 
15 Thienyl) -alanine . 

Among the peptides of formulas (I) and (11) or 
their derivatives, the invention specifically considers 
those derived from protegrins and tachyplesins referred 
to in tables I and II below. 



20 
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Table I : Protegrin derivatives 



Code 


Sequence 


Modification 


SM1738 


Arg Gly Gly Arg Leu Ser Tyr Ser 
Arg Arg Arg Phe Ser Val 
Ser Val Gly Arg 

(SEQ ID NO : 15) 


Head of series 


SM1736 


Arg Gly Gly Arg Leu Ser Tyr Ser 
Arg Arg Arg Phe Ser Val Ser Val 
Gly Arg 

(SEQ ID NO : 16) 


Aa of D form of SM173 8 


SM1727 


Arg Gly Val Ser Val Ser Phe Arg 
Arg Arg Ser Tyr Ser Leu Arg Gly 

nU/ A rrr 

(SEQ ID NO : 17) 


Retro form of SM173 8 


SM1739 


Glu Gly Gly Glu Leu Ser Tyr Ser 
Glu Glu Glu Phe Ser Val Ser Val 
Gly Glu 

(SEQ ID NO : 18) 


Reversed charge (R — > E) 


SM2187 


Arg Gly Gly Arg Leu Ala Tyr Arg 
Leu Leu Arg Phe Ala He Arg Val 
Gly Arg 

(SEQ ID NO : 19) 


Increased amphipathicity 


SM2188 


Oaa Gly Gly Oaa Xaa Xaa Baa 
Oaa Xaa Xaa Oaa Baa Xaa Xaa 
Xaa Oaa Xaa Gly 

(SEQ ID NO : 20) 


Increased hydrophobicity 


SM2189 


Arg Ala Ala Arg Leu Gly Tyr Arg 
Xaa Xaa Arg Phe Gly Zaa Arg 
Val Gly Arg 

(SEQ ID NO : 21) 


Increased amphipathicity 


SM2194 


Tyr Arg Arg Arg Phe Ser Val Ser 
Val Arg 

(SEQ ID NO : 22) 


C-terminal end of SM2193 


SM2195 


Arg Arg Leu Ser Tyr Ser Arg Arg 
Arg Phe 

(SEQ ID NO : 23) 


N-terminal end of SM2193 
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SM2193 


Arg Arg Leu Ser Tyr Ser Arg Arg 
Arg Phe Ser Val Ser Val Arg 

(SEQ ID NO : 24) 


Reduced flexibility (G 
deletion) 


SM2196 


Arg Gly Gly Arg Leu Ser Tyr Ser 
Arg Arg Arg Phe Ser Thr Ser Thr 
Gly Arg 

(SEQ ID NO : 25) 


Inhibition dimerisation 



5 Table II : Tachyplesin derivatives 



Code 


Sequence 


Modification 


SM1726 


Lys Trp Ser Phe Arg Val Ser 
Tyr Arg Gly He Ser Tyr Arg 
Arg Ser Arg 

(SEQ ID NO : 26) 


Head of series 


SM2307 


Arg Trp Ser Phe Arg Val Ser 
Tyr Arg Gly He Ser Tyr Arg 
Arg Ser Arg 

(SEQ ID NO : 27) 


K — > R mutation 


SM2392 


Arg Trp Ser Phe Arg Val Ser 
Tyr Arg Gly He Ser Tyr Arg 
Arg Ser Arg 

(SEQ ID NO : 28) 


Aa of D form (of SM23 07) 


SM2309 


Lys Trp Ser Phe Arg Val Ser 
Tyr Arg Gly He Ser Tyr Arg 
Arg Ser Arg 

(SEQ ID NO : 29) 


Aa of D form (of SM1726) 


SM2310 


Arg Ser Arg Arg Tyr Ser He 
Gly Arg Tyr Ser Val Arg Phe 
Ser Trp Lys 

(SEQ ID NO : 30) 


Retro form 
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SM2190 


Oaa Baa Xaa Baa Oaa Xaa Xaa 
Baa Oaa Gly Xaa Oaa Baa Xaa 
Xaa Oaa Xaa 

(SEQ ID NO : 31) 


Increased hydrophobicity 


SM2191 


Lys Trp Ala Phe Arg Val Ala 
Tyr Arg Gly He Arg Tyr Leu 
Leu Arg Leu 

(SEQ ID NO : 32) 


Increased amphipathicity 


SM2192 


Lys Tyr Ala Trp Arg Val Ala 
His Arg Gly He Arg Trp Leu 
Leu Arg Xaa 

(SEQ ID NO : 33) 


Increased amphipathicity 



In the sequences of tables I and II above, Baa 
represents Naphthylalanine, Oaa represents Ornithine, Xaa 
represents Norleucine and Zaa represents Norvaline. 

5 The invention also concerns the use of the above 

peptides for vectoring one or more active substances both 
fox" therapeutic and for diagnostic applications. As 
active substance, the invention gives particular 
consideration to proteins or moieties of proteins, such 

10 as polypeptides or peptides, antibodies or parts of 
antibodies, nucleic acids and oligonucleotides or 
ribozymes, or even, obviously, active chemical molecules 
for the treatment or prevention of human or animal 
pathologies, such as for example, but not restricted to, 

15 ant i -tumorals, antivirals, anti- inflammatories, agents 
preventing the degradation of organs and/or tissues, 
etc . . . 

In the field of diagnostics, the active substance 
may be a radioactive marker, a stain marker, or any other 
20 means or substance able to reveal a metabolism or a 
pathology. 

A further purpose of the invention is therefore 
compounds of formula (IV) as follows and compositions 
containing them : 
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(Y)n ^^m IV 

in which : 

A represents a linear peptide derived from an 
5 antibiotic peptide in accordance with the invention, 

Z represents an active substance, such as 
defined above, 

Y represents a signal agent, 

n is 0 or more, advantageously 0 or 1, 

10 - m is 1 or more, preferably up to 10, 

advantageously up to 5 . 

Therefore, the above formula (IV) compounds are 
formed from a peptide of the invention coupled with one 
or more active substances, identical or different, 

15 represented by the (Z) group in formula (IV) , and 
optionally one or more signal agents, represented by the 
(Y; group in formula (IV) , having an addressing role for 
the compound of formula (IV) towards a cell type, a site 
or cell compartment or a given tissue. More particularly, 

20 the signal agent (Y) is an oligopeptide or a protein, 
such as a signal peptide, a nuclear localising signal, an 
antibody moiety, or a chemical molecule ligand or anti- 
ligand of a receptor. 

In a special embodiment of the compounds of 
25 formula (IV) , group (Y) is fixed to group (Z) . 

This coupling, symbolised by the horizontal lines 
in formula (IV) , may be conducted by any acceptable 
linking means , taking into consideration the chemical 
nature, the size and number of groups (Z) and (Y) in the 
30 compounds of formula (IV) , such as covalent, hydrophobic 
or ionic bonds, which may or may not be cleaved in 
physiological media. Coupling may be conducted at any 
site of peptide (A) , at which functional groups such as - 
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OH, -SH, -COOH, -NH2 are naturally present or have been 
inserted. 

The invention gives consideration to the fixation 
of several (Z) groups to one and the same site of peptide 
5 (A) either directly, if this site comprises several 
functional groups as is the case for a C- or N-terminal 
lysine, or indirectly via an intermediate group carrying 
several reaction groups enabling the fixation of several 
(Z) groups. 

10 The preferred coupling positions for the active 

substance are at the N-terminal and C-terminal ends or at 
the primary amino groups carried by the side chains of 
the lysines of peptide (A) . If the C-terminal end of 
peptide (A) is used to attach active substance (Z) , the 

15 N-terminal end is available for optional coupling to a 
signal agent (Y) enabling the compound of the invention 
to be addressed either towards the nucleus, or towards a 
given tissue type. 

For example, if the C-terminal end of a linear 
20 peptide of the invention is used to couple an active 
substance made up of a fluorescent marker, or biotin, or 
a medicinal molecule such as doxorubicin, the covalent 
peptide-drug complex distributes itself after 
administration within the cytoplasm of the target cell. 
25 It is possible to bring this complex into the nuclear 
compartment by using the N-terminal end of the peptide to 
couple a short basic sequence, for example of around 7 
amino acids, corresponding to a nuclear localising 
signal. Under these conditions, the biotin or doxorubicin 
30 are found in the cell nucleus. 

In the same way, it is possible to vector a drug 
towards a given cell type, by using the N-terminal end of 
the linear peptide of the invention coupled at its C- 
terminal end to a medicinal agent, to add a peptide 
35 sequence able to specifically recognise a determinant 
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present on the surface of cell type. Synthetic 
pentadecapeptide otM2 for example (Swolapenko G.B. et al . , 
1995, The Lancet 346, 1662-65) a moiety of a monoclonal 
antibody, directed against an antigen expressed by breast 
5 cancer cells (Tumour Associated Antigen Polymorphic 
Epithelial Mucin) maintains good affinity for these 
cells. It is therefore possible, by associating aM2 with 
a linear peptide-medicinal agent complex, to bring this 
group preferably towards the cells which express the 
10 antigen characteristic related to breast cancer. 

The compounds of formula (IV) may be prepared by 
chemical synthesis or by using molecular biology 
techniques . 

For chemical syntheses, commercially available 
15 equipment can be used allowing the incorporation of non- 
natural amino acids, such as D enantiomers and residues 
with side chains of different hydrophobic ity and size to 
those of their natural homologues. At the time of 
synthesis it is evidently possible to conduct a wide 
20 range of modifications, for example to insert a lipid 
(prenyl or myristyl) on the N- terminal so as to be able 
to anchor the peptide of the invention and hence the 
formula (IV) compound to a lipid membrane such as that of 
a liposome made up of positively charged lipids. It is 
25 also possible to replace one or more peptide bonds (-C0- 
NH-) by equivalent structures such as -C0-N(CH3)-, -CH2- 
CH2-, -CO-CH2-, or to interpose groups such as -CH2-, 
-NH-, -0-. 

It is also possible to obtain formula (IV) 
30 compounds, or part thereof having, a protein nature, from 
an encoding nucleic acid sequence. A further purpose of 
the invention is a nucleic acid molecule comprising or 
made up of a nucleic sequence coding for a linear peptide 
derived from an antibiotic peptide. More particularly, 
35 the invention concerns a nucleic acid molecule comprising 
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at least one sequence coding for a formula (IV) compound 
or part thereof having a protein nature. These nucleic 
acid sequences may be DNAs or RNAs and be associated with 
control sequences and/or inserted in vectors . The vector 
5 used is chosen in relation to the host to which it will 
be transferred ; it may be any vector such as a plasmid. 
These nucleic acids and vectors are useful for producing 
the linear peptides and formula (IV) compounds, or part 
of the latter having a protein nature, in a host cell. 
10 The preparation of these vectors and the production or 
expression in a host of linear peptides or formula (IV) 
compounds may be conducted using molecula-r biology and 
genetic engineering techniques well known to those 
skilled in the art, 

15 By way of example, said method for producing a 

peptide of the invention consists of : 

transferring a nucleic acid molecule or a 
vector containing said molecule into a host cell, 

culturing said host cell under conditions 
20 enabling the production of the peptide, 

isolating, by any appropriate means, the 
peptides of the invention. 

The host cell used in this type of method may be 
chosen from among prokaryotes or eukaryotes, in 
25 particular from among bacteria, yeasts, mammalian, plant 
or insect cells. The invention therefore also concerns 
transformed cells expressing the linear peptides or 
formula (IV) compounds or part of the latter having a 
protein nature. 

30 The invention also relates to : 

pharmaceutical compositions comprising as 
active ingredient at least one formula (IV) compound 
optionally associated with an acceptable vehicle or 
carrier. 
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diagnostic agents containing at least one 
formula (IV) compound. 

Other characteristics and advantages of the 
invention will become apparent in the following 
5 description concerning the preparation of formula (IV) 
compounds and the research work which led to revealing 
the vectoring properties of the linear peptides of the 
invention derived from antibiotic peptides. 

Example 1 : Fixing biotin and doxorubicin onto 
10 linear analogues of antibiotic peptides . 

1) Preparation of linear peptides 

The three peptides with the sequences given below 
were synthesised : 

Arg Gly Gly Arg Leu Xaa Tyr Xaa Arg Arg Arg 
15 Phe Xaa Val Xaa Val Gly Arg-NH2 (SEQ ID NO : 34) 

Arg Arg Trp Xaa Phe Arg Val Xaa Tyr Arg Gly 
Phe Xaa Tyr Arg Lys Xaa Arg-NH2 (SEQ ID NO : 35) 

Lys Trp Xaa Phe Arg Val Xaa Tyr Arg Gly -lie 
Xaa Tyr Arg Arg Xaa Arg-NH2 (SEQ ID NO : 36) 

20 in which Xaa represents the serine, threonine or 

alanine residues. 

These peptides are respectively derived from the 
sequences of Protegrin PG-1 having the formula : 

Arg Gly Gly Arg Leu Cys Tyr Cys Arg Arg Arg 
25 Phe Cys Val Cys Val Gly Arg -NH2 (SEQ ID NO : 1) 

of Tachyplesin 1 having the formula : 

Lys Trp Cys Phe Arg Val Cys Tyr Arg Gly lie 
Cys Tyr Arg Arg Cys Arg -NH2 (SEQ ID NO : 8) 

of Polyphemus in having the formula : 

30 Lys Trp Xaa Phe Arg Val Xaa Tyr Arg Gly lie 

Xaa Tyr Arg Arg Xaa Arg-NH2 (SEQ ID NO : 36) 
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These three peptides may be prepared 
indifferently either from BOC chemistry or from FMOC 
chemistry using conventional synthesis methods in solid 
or homogeneous phase. 

5 2) Fixing biotin onto linear peptides 

The peptide is synthesised in solid phase and, 
after incorporation of the N- terminal arginine, 5- 
aminopentanoic acid is added. The FMOC or BOC N-terminal 
is removed, and on the peptide still adhering to the 

10 resin, the N-hydroxy succimido biotin ester is caused to 
react in dimethyl formamide . After 15 hours' reaction at 
room temperature, the biotinylated peptide is cut from 
the carrier through the action of trif luoroacetic acid or 
hydrofluoric acid following well-established protocols in 

15 peptide chemistry. The peptide is then purified by high 
pressure liquid chromatography. 

3) Fixing doxorubicin onto a linear peptide 

To fix doxorubicin, solid phase synthesis is made 
of the peptide having the formula : 

20 Arg Gly Gly Arg Leu Xaa Tyr Xaa Arg Arg Arg Phe 

Xaa Val Xaa Val Gly Arg-NH2 (SEQ ID NO : 37) 

After cleaving from the purification substrate, 
the peptide is treated with glutaric anhydride in the 
presence of triethylamine . The peptide is then purified 

25 and the -COOH group carried by the glutaryl at the N- 
terminal is activated by the diisopropylcarbodiimide and 
l-hydroxybenzotriazole mixture. After two hours' reaction 
at room temperature, the doxorubicin is added and the 
mixture is stirred for 12 hours at 0°C. The peptide- 

30 doxorubicin unit is then purified by high pressure liquid 
chromatography . 

Example 2 : Ability of the linear peptides of the 
invention to pass through cell membranes. 

1) Cell models 
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The ability of the peptides to pass through the 
membranes was tested on various cell types (MCF7, MCF7R, 
HL60, HL60R, HeLa) . 

The cells are cultured on RPMI 1640 (Gibco) to 
5 which is added 10 % (v/v) fetal calf serum, 2mM glutamine 
and 2mM pencillin/streptomycin at 37 °C. 30' 000 cells are 
seeded in Lab Tek chambers and cultured for 1 day. 

2) Treatment with linear peptides-biotin 
prepared according to example 1 (2) 

10 The cells are incubated in Opti-Mem (Gibco) for 

one hour before being treated for variable time periods 
with biotin-labelled peptides. 

The latter are obtained in accordance with 
example 1 (2) by treating 1 equivalent of linear peptide 
15 with 2 equivalents of N-hydroxysuccinimide biotin ester, 
then purified by high pressure liquid chromatography. 

The cells are then fixed with a 3.7 % solution of 

paraformaldehyde for 5 minutes at 25°C, then rinsed three 

times with PBS. They are then permeabilised with 0.1 % 

20 Triton (1 min. room temperature) . After three rinsings in 

PBS the cells are incubated 10 min with 200 jal TexRed 

th 

antibodies diluted to 300 and rinsed three times in 
PBS. The slides are finally mounted with a Mowiol-Dabco 
solution and observed under an Axiophot photomicroscope . 

25 3) Treatment with linear peptides -doxorubicin 

prepared in accordance with example 1 (3) 

The cells are incubated for 15 minutes , then 
rinsed with PBS and the doxorubicin present in the cell 
is determined by chromatography. 

30 4) Results 

a) Among the peptides studied, those which pass 
the most easily through the membranes are those with the 
following formulas : 
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Arg Xaa Xaa Arg Xaa Uaa Xaa Uaa Arg Arg Arg Xaa 
Uaa Xaa Uaa Xaa Xaa Arg -NH2 (V) (SEQ ID NO : 13) 

Arg Arg Xaa Uaa Xaa Arg Xaa Uaa Xaa Arg Xaa Xaa 
Uaa Xaa Arg Arg Uaa Arg -NH2 (VI) (SEQ ID NO : 14) 

5 in which 

Uaa represents serine or threonine, and 

the Xaa groups, identical or different, 
represent an amino acid which may or may not be natural 
(including D-amino acids) , either aliphatic or aromatic, 

10 such as glycine, alanine, valine, norleucine, isoleucine, 
leucine, cysteine, cysteine , penicillamine, 

methionine , serine , threonine , asparagine , glutamine , 
phenylalanine, histidine, tryptophan, tyrosine, proline, 
Amino butyric acid, carboxylic amino- 1-cyclohexane acid, 

15 Amino isobutyric acid, carboxylic 2-aminotetraline, 4- 
bromophenylalanine , tert -Leucine, 4 -chlorophenylalanine, 
P-cyclohexylalanine, 3 , 4-dichlorophenylalanine, 4- 

f luorophenylalanine, homoleucine, p-homoleucine, 

homophenylalanine , 4 -methylphenylalanine , 1 - 

20 naphthylalanine , 2 -naphthylalanine , 4 -nitrophenylalanine , 
3 -nitrotyrosine, nerval ine, phenylglycine, 3- 

pyridylalanine, p- (2-Thienyl) -alanine. . 

b) The results of the experiments conducted with 
doxorubicin show a significant increase in the plasma and 

25 nuclear concentration of doxorubicin when the latter is 
coupled with the linear peptide of the invention compared 
with the use of doxorubicin alone. 

c) The experiments with biotin were conducted 
more especially on MCF7 cells treated at different times 

30 with a complex of biotin and a peptide of the invention 
having the formula : 

biotin-Arg Gly Gly Arg Leu Ser Tyr Ser Arg Arg 
Arg Phe Ser Val Ser Val Gly Arg-NH2 (SEQ ID NO : 15) 

This work was photographed (not shown) : 
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Control in which the cell was treated with 
biotin alone. 

Treatment of the cell for 2 minutes with a 
complex : biotin-linear peptide of the invention, 

5 - Treatment of the cell for 3 0 minutes with a 

complex: biotin-linear peptide of the invention. 

It can be seen in these photographs that biotin 
alone does not enter the cell and accumulates weakly 
around the cell . Conversely, with the complex of the 
10 invention, it can be seen that the biotin is rapidly led 
by the linear peptide of the invention inside the cell in 
which it is present in the cytoplasm and cell nucleus. 

Example 3 : Internalisation ability of the linear 
peptides of the invention 

15 Linear peptides of the invention derived from 

Protegrins and Tachyplesins were tested on different cell 
lines for the purposes of assessing their respective 
internalisation. 

1) Experimental conditions 

4 

20 The cells were seeded at approximately 10 cells 

per dish, 24 h before the addition of biotinylated 
peptides. On the day of the experiment confluence was 60- 
80%. The biotinylated peptides are incubated with the 
cells at a concentration of 10 )iM for 15 minutes at 37 °C 

25 in an atmosphere of 95% humidity and 5% C02 in an OptiMem 
medium. The cells are washed three times with PBS at room 
temperature and are then fixed with formalin (3.7% 
formaldehyde in PBS, 10 min at room temperature) . They 
are then washed in PBS and permeabilised for 15 min with 

30 PBS-TritonX-100 . Development is made with streptavidin- 
Texas-Red for 15 min away from light and the cells are 
then slide mounted. They are observed under a 
fluorescence microscope and compared with a positive 
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control {Ap43-58) , well described in the literature, and 
with a negative control. 

The cell nuclei were Hoechst stained. 

2) Cell lines 

5 All the lines tested are of human origin and were 

commercially obtained from ATCC. 

Non-tumoral lines : MRC5 (lung fibroblast) , 
HuVeC (endothelial, umbilical cord) 

Tumoral lines : HT29 (colon carcinoma) , HepG2 
10 (hepatoblastoma) , A172 (glioblastoma) , HMCB (melanoma) . 

The cells are cultured at 37 °C in an atmosphere 
of 95% humidity and 5% C02 . The culture medium is the one 
recommended by ATCC. 

3) Tested peptides 

15 The two series of tested peptides are those given 

in tables I and II. 

4) Results 

The internalisation results are shown in tables 
III and IV below. The peptides penetrate the cells with 

20 different degrees of internalisation. Some (such as 
SM1739 and SM2190) are not internalised whereas others 
(such as SM2307, SM2187 etc..) penetrate with good 
efficacy. We also observed that some peptides enter into 
a given cell type more than in others. SM2196 for example 

25 has better internalisation in. tumoral cells (HepG2, A172 
and HT29) than in non-tumoral cells (MRC5 and HuVeC) . 
Conversely, the SM1738 peptide has greater penetration 
in non-tumoral lines than in tumoral lines. These results 
suggest the existence of cell tropism. 

30 Generally it would appear that the retro form of 

the heads of series does not significantly modify 
internalisation. Increased hydrophobicity has a negative 
effect for both families of tested peptides. It is 
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therefore advisable to avoid increasing hydrophobia. On 
the other hand, an increase in amphipathicity seems to 
have a positive effect at least for the Protegrin family, 

Table III : Protegrin derivatives 





HepG2 


A172 


HMCB 


HuVeC 


MRC5 


HT29 


Internalisation 


SM1738 


+ 


+ 


+ 


+++ 


+ + + 


+ 


Reference 


SM1727 


0 


++ 


+ + 


+++ 


+ 


+ 


No significant 
effect 


SM1736 


++ 


+ 


+++ 


++++ 


+ +++ 




No significant 
effect 


SM1739 


0 


+ 


+ 


0 


0 


0 


Negative effect 


SM2187 


+++ 


++ + 


+ +++ 


+++ 


++++ 


++ + 


Positive effect 


SM2189 


+++ 


+ + 


++ + 


++ 


++++ 


+ + 


Positive effect 


SM2188 


0 


0 


0 


++ 


0 


0 


Negative effect 


SM2193 


++ 


+ + 


++ + 


++ 


0 


0 


Negative effect 


SM2194 


0 


+ 


++ + 


+ 


+ 


0 


Negative effect 


SM2195 


++++ 


0 


+++ 


+ 


+ 


++++ 


Contradictory 


SM2196 


++++ 


++++ 


++ 


+ 




++++ 


Tropism 



5 

Fluorescence microscopy photographs of 
internalisation are shown in figures 1 and 2. In the A172 
and HT29 lines, the SM1738 peptide, shown as an example, 
appears to be mainly localised in the cytoplasm and in a 
10 perinuclear zone. For the HuVec line, the peptide is 
mainly localised in the cytoplasm. The left column 
corresponds to nucleus staining with Hoechst. 



Table IV : Tachyplesin derivatives 





HepG2 


A172 


HMCB 


HuVeC 


MRC5 


HT29 


Internalisation 


SM172 6 


+ + + 


+ 


+ + + + 
+ 


+++ 


+ + + 


++ + 


Reference 


SM2310 


ND 


+ + 


+++ + 


+-f + 


+ + 


++ + 


No effect 


SM2309 


ND 


+++ + 


+ + 


++ 


++++ 


+++ + 


ND 


SM2191 


++ 


++ 


++ 


ND 


+ + + 


+++ 


No effect 



25 



SM2192 


+ 


+ + + 


+++ + 


+++ 


++++ 


++ 


No effect 


SM2190 


0 


0 


0 


0 


0 


0 


Negative effect 


SM2307 


ND 


+++ + 
+ 


++++ 
+ 


++++ 


++++ 


++++ 
+ 


Positive effect 


SM2392 


ND 


++ + 


++++ 


++ 


+++ 


++++ 


No effect 



ND = not determined 



The internalisation photographs are shown in 
appended figures 3 and 4. For the 3 cell lines shown 
{A172, HT29, HuVeC) the biotinylated peptide is localised 
5 in the cytoplasm in diffuse manner and also distinctly 
labels the nucleolus. The left column corresponds to 
nucleus staining with Hoechst. 

Example 4 : Internalisation of vectored doxo- 
rubicin 

4 

10 The cells are seeded to approximately 10 cells 

per dish 24 h before the addition of the products. On the 
day of the experiment confluence is 60-6.0%. The free 
doxorubicin or the doxorubicin coupled to the SM173 8 
vector are incubated with the MCF7 cells at a 

15 concentration of 10 \xM for 60 minutes at 37 °C in an 
atmosphere of 95% humidity and 5% C02 in the culture 
medium. The subcell localisation of doxorubicin, 
naturally fluorescent, was determined by confocal 
microscopy. The results are given in appended figure 5. 

20 The localisation is partly cytoplasmic and partly 
nuclear. The nucleus in this case is labelled in diffuse 
manner . 

In the peptide sequences listed below, the amino 
acids are represented by their one-letter code, but they 
25 may also be represented by their three-letter code 
according to the following nomenclature : 

A Ala alanine 
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c 


Cys 


cysteine 


D 


Asp 


aspartic acid 


E 


Glu 


glutamic acid 


F 


Phe 


phenylalanine 


G 


Gly 


glycine 


H 


His 


histidine 


I 


He 


isoleucine 


K 


Lys 


lysine 


L 


Leu 


leucine 


M 


Met 


methionine 


N 


Asn 


asparagine 


P 


Pro 


proline 


Q 


Gin 


glutamine 


R 


Arg 


arginine 


S 


Ser 


serine 


T 


Thr 


threonine 


V 


Val 


valine 


W 


Trp 


tryptophan 


Y 


Tyr 


tyrosine 



